Picture for Chen Zhang

Chen Zhang

SenseTime Research

CICADA: Cross-Domain Interpretable Coding for Anomaly Detection and Adaptation in Multivariate Time Series

Add code
May 01, 2025
Viaarxiv icon

Towards Flow-Matching-based TTS without Classifier-Free Guidance

Add code
Apr 29, 2025
Viaarxiv icon

From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs

Add code
Apr 22, 2025
Viaarxiv icon

HistLLM: A Unified Framework for LLM-Based Multimodal Recommendation with User History Encoding and Compression

Add code
Apr 14, 2025
Viaarxiv icon

MiLiC-Eval: Benchmarking Multilingual LLMs for China's Minority Languages

Add code
Mar 03, 2025
Viaarxiv icon

Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

Add code
Feb 26, 2025
Viaarxiv icon

KAPPA: A Generic Patent Analysis Framework with Keyphrase-Based Portraits

Add code
Feb 18, 2025
Viaarxiv icon

HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation

Add code
Feb 10, 2025
Figure 1 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 2 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 3 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 4 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Viaarxiv icon

A Survey on Multi-Turn Interaction Capabilities of Large Language Models

Add code
Jan 17, 2025
Viaarxiv icon

Data and System Perspectives of Sustainable Artificial Intelligence

Add code
Jan 13, 2025
Viaarxiv icon